Potential-based multiobjective reinforcement learning approaches to low-impact agents for AI safety
نویسندگان
چکیده
The concept of impact-minimisation has previously been proposed as an approach to addressing the safety concerns that can arise from utility-maximising agents. An impact-minimising agent takes into account potential impact its actions on state environment when selecting actions, so avoid unacceptable side-effects. This paper proposes and empirically evaluates implementation within framework multiobjective reinforcement learning. key contributions are a novel potential-based specifying measure impact, examination variety non-linear action-selection operators achieve acceptable trade-off between achieving agent’s primary task minimising environmental impact. These experiments also highlight unreported issue with noisy estimates for agents using action-selection, which broader implications application
منابع مشابه
Hierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents
This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...
متن کاملPareto-Based Multiobjective AI Planning
Real-world problems generally involve several antagonistic objectives, like quality and cost for design problems, or makespan and cost for planning problems. The only approaches to multiobjective AI Planning rely on metrics, that can incorporate several objectives in some linear combinations, and metric sensitive planners, that are able to give different plans for different metrics, and hence t...
متن کاملThe Potential of Reinforcement Learning for Live Musical Agents
Reinforcement learning has great potential applicability in computer music, particularly for interactive scenarios and the production of effective control policies for systems. This paper considers in particular the case of interactive music systems, where a software agent can be trained online during a rehearsal session with a musician. A small-scale system is described which uses the Sarsa(λ)...
متن کاملLearning to Teach Reinforcement Learning Agents
In this article we study the transfer learning model of action advice under a budget. We focus on reinforcement learning teachers providing action advice to heterogeneous students playing the game of Pac-Man under a limited advice budget. First, we examine several critical factors affecting advice quality in this setting, such as the average performance of the teacher, its variance and the impo...
متن کاملhierarchical functional concepts for knowledge transfer among reinforcement learning agents
this article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for reinforcement learning agents. these definitions are used as a tool of knowledge transfer among agents. the agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. in other words, the agents are assumed t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Engineering Applications of Artificial Intelligence
سال: 2021
ISSN: ['1873-6769', '0952-1976']
DOI: https://doi.org/10.1016/j.engappai.2021.104186